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Abstract 

The relationship between the size and the variance of firm growth rates is known to follow an 
approximate power-law behavior cr(S) ~ S~™ S ' where S is the firm size and P(S) ~ 0.2 is an 
exponent weakly dependent on S. Here we show how a model of proportional growth which treats 
firms as classes composed of various number of units of variable size, can explain this size- variance 
dependence. In general, the model predicts that /3(S) must exhibit a crossover from /3(0) = to 
/3(oo) = 1/2. For a realistic set of parameters, 0(S) is approximately constant and can vary in 
the range from 0.14 to 0.2 depending on the average number of units in the firm. We test the 
model with a unique industry specific database in which firm sales are given in terms of the sum of 
the sales of all their products. We find that the model is consistent with the empirically observed 
size- variance relationship. 
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I. INTRODUCTION 



Gibrat was probably the first who noticed the skew size distributions of economic sys- 
tems As a simple candidate explanation he postulated the "Law of Proportionate Ef- 
fect" according to which the expected value of the growth rate of a business firm is propor- 
tional to the current size of the firm [2L Several models of proportional growth have been 

3. 



subsequently introduced in economics 



0,0]. In particular, Simon and collegues [?], [§} 
examined a stochastic process for Bose-Einstein statistics similar to the one originally pro- 
posed by Yule js] to explain the distribution of sizes of genera. The Simon model is a Polya 
Urn model in which the Gibrat 's Law is modified by incorporating an entry process of new 
firms. In Simon's framework, the firms capture a sequence of many independent "opportu- 
nities" which arise over time, each of size unity, with a constant probability b that a new 
opportunity is assigned to a new firm. Despite the Simon model builds upon the Gibrat 's 
Law, it leads to a skew distribution of the Yule type for the upper tail of the size distribution 
of firms while the limiting distribution of the Gibrat growth process is lognormal. The Law 
of Proportionate Effect implies that the variance a 2 of firm growth rates is independent of 
size, while according to the Simon model it is inversely proportional to the size of business 
firms. Hymer, Pashigian and Mansfield 1CJ, llll] noticed that the relationship between the 



variance of growth rate and the size of business firms is not null but decreases with increase 
in size of firm by a factor less than 1/K we would expect if firms were a collection of K 
independent subunits of approximatively equal size. In a lively debate in the mid-Sixties 



Simon and Mansfield |l2j argued that this was probably due to common managerial influ- 
ences and other similarities of firm units which implies the growth rate of such components 
to be positive correlated. On the contrary Hymer and Pashigian 13| maintained that larger 



firms are riskier than expected because of economies o 



scale and monopolistic power. Fol- 



lowing Stanley and colleagues 14J several scholars 15|, [16] have recently found a non-trivial 
relationship between the size of the firm 5* and the variance a 2 of its growth rate: 
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with (3 » 0.2. 

Numerous attempts have been made to explain this puzzling evidence by considering 



firms as collection of independent units of uneven size 



14 



16 



17 



18 



m 



20 



21 



221 but 



2 



existing models do not provide a unifying explanation for the probability density functions 
of the growth and size of firms as well as the size variance relationship. 

Thus, the scaling of the variance of firm growth rates has been considered to be a crucial 



unsolved problem in economics [23|, 



Recent papers 
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29| | provide a general 



framework for the growth and size of business firms based on the number and size distribution 

nnrr 



of their constituent parts [16|, LL7J, llSl 
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. Specifically, Fu and colleagues [25] 



present a model of proportional growth in both the number of units and their size, drawing 
some general implications on the mechanisms which sustain business firm growth. According 
to our model, the probability density function (PDF) of the growth rates is Laplace in the 
center with power law tails. The PDF of the firm growth rates is markedly different from a 
lognormal distribution of the growth rates predicted by Gibrat and comes from a convolution 
of the distribution of the growth rates of constituent units and the distribution of number 
of units in economic systems. The model by Fu and colleagues [25J accurately predicts 
the shape of the size distribution and the growth distribution at any level of aggregation of 
economic systems. In this paper we derive the implications of the model on the size-variance 
relationship. In principle, the predictions of the model can be studied analytically, however 
due to the complexity of the resulting integrals and series which cannot be expressed in 
elementary functions we will rely in our study on computer simulations. The main conclusion 
is that the relationship between the size and the variance of growth rates is not a true power 
law with a single well-defined exponent j3 but undergoes a slow crossover from /3 — for 
S -> to p = 1/2 for S -> oo. 



II. THE MODEL 



We model business firms as classes consisting of a random number of units of variable 
size. The number of units K is defined as in the Simon model Q. The size of the units £ 
evolves according to a geometric brownian motion or Gibrat process [l|. 

As in the Simon model, business firms as classes consisting of a random number of units 0, 
Isl, 17, 3(]] . Firms grow by capturing new business opportunities and the probability that a 
new opportunity is assigned to a given firm is proportional to the number of opportunities 
it has already got. At each time t a new opportunity is assigned. 

With probability b, the new opportunity is taken up by a new firm, so that the average 
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number of firms at time t is N(t) = N(0) + bt. 

With probability 1 — 6, the new opportunity is captured by an active firm a with proba- 
bility P a — (1 — b)K a (t)/t, where K a (t) is the number of units of firm a at time t. 

In the absence of the entry of new firms (b = 0) the probability distribution of the number 
of the units in the firms at large t, i.e. the distribution P(K) is exponential: 

P ( K )^Y^ e M~K/K(t)\ (2) 

where K(t) = [n(0) + t]/N(Q) is the average number of units in the classes, which linearly 
grows with time. 

If b > 0, P{K) becomes a Yule distribution which behaves as a power law for small K: 

P new (K) ~ K-f, (3) 



where <p = 2 + b/(l — b) > 2, followed 



K(t) = [n(0) +t] 1 - b n(0) b /N(0) Q, Q. This model can be generalized to the case when 
the units are born at any unit of time if with probability 9, die with probability A, and 
in addition a new class consisting of one unit can be created with probability b 1 by letting 
t = t'(6 - A + V) and probability b = b'/(6 - A + If). 

In the Simon model opportunities are assumed to be of unit size so that S a (t) = K a (t). On 
the contrary we assume that each opportunity has randomly determined but finite size. In 
order to capture new opportunities firms launch new products, open up new establishments, 
divisions or units. Each opportunity is taken up by exactly one firm and the size of the firm 
is measured by the sum of the sizes of the opportunities it has taken up. Fig. [1] provides a 
schematic representation of the model. 

In the following we consider products as the relevant constituent parts of the companies 
and measure their size in terms of sales. The model can be applied to alternative decom- 
positions of economic systems in relevant subunits (i.e. plants) and measures of their sizes 
(i.e. number of employees). 

At time t, the size of each product £,(£) > is decreased or increased by a random factor 
r]i(t) > so that 

&t) = Ut-l)Vi(t), (4) 
where ??«(£), the growth rate of product i, is independent random variable taken from a 
distribution P v (r)i), which has finite mean and standard deviation. We also assume that 
liarji has finite mean fi^ = (In and variance V v = ((lnr^) 2 ) — /i 2 . 



yy the exponential decay of Eq. (j2j) for large K with 



Thus at time t a firm a has K a (t) products of size i = 1, 2, ...K a (t) so that its total 
size is defined as the sum of the sales of its products S a (t) = Ylf=i &(*) anc ^ growth rate 
is measured as g = log(S a (t) / S a (t — 1)). 

The probability distribution of firm growth rates P(g) is given by 

oo 

P{g)^Y, P ^ K ) P ^\K\ (5) 

K=l 

where P(g\K) is the distribution of the growth rates for a firm consisting of K products. 
Using central limit theorem, one can show that for large K and small g, P(g\K) converge 
to a Gaussian distribution 

where V and m are functions of the distributions P^ and P,. For the most natural 
assumption of the Pure Gibrat process for the sizes of the products these distributions are 
lognormal: 

= "7= £ exp (-(In 6 - m c ) 2 /2^) , (7) 

P,(^) = -== - exp (-(him ~ m v ) 2 /2V v ) . (8) 
'27rV v m 



In this case, 



and 



m = m v + (9) 



V = Ka 2 = exp(^)(exp(ig - 1), (10) 

but for large the convergence to a Gaussian is an extremely slow process. Assuming 
that the convergence is achieved, one can analytically show [25j that P(g) has similar be- 
havior to the Laplace distribution for small g i.e. P(g) ~ exp(— y/2\g\/W) /y/2V, while for 
large g P(g) has power law wings P(g) ~ g~ 3 which are eventually truncated for g — > oo by 
the distribution P^ of the growth rate of a single product. 

To derive the size variance relationship we must compute the conditional probability 
density of the growth rate P(g\S, K), of an economic system with K units and size S. 
For K — > oo the conditional probability density function P(g\S,K) develops a tent shape 
functional form, because in the center it converges to a Gaussian distribution with the width 
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decreasing inverse proportionally to yK, while the tails are governed by the behavior of the 
growth distribution of a single unit which remains to be wide independently of K. 

We can also compute the conditional probability P(S\K), which is the convolution of K 
unit size distributions Pg. In case of lognormal P^ with a large logarithmic variance and 
mean mg, the convergence of P(S\K) to a Gaussian is very slow (see Chapter II). Since 
P{S,K) = P(S\K)P(K), we can find 

P(g\S) = J2 P(9\S, K)P(S\K)P(K), (11) 

where all the distributions P(g\S, K), P(S\K), P(K) can be found from the parameters of 
the model. P(S\K) has a sharp maximum near S = Sk = K^, where ft = exp(m^ + V£/2) 
is the mean of the lognormal distribution of the unit sizes. Conversely, P(S\K) as function 
of K has a sharp maximum near K$ = S/ ft. For the values of S such that P(Ks) » 0, 
P(g\S) ~ P(g\K s ), because P(S\K) serves as a S(K — Kg) so that only terms with K « K s 
make a dominant contribution to the sum of Eq. ( TTTT) . Accordingly, one can approximate 
P(g\S) by P(g|_K"s) and u(S) by a(Ks). However, all firms with S < S\ = ^ consist 
essentially of only one unit and thus 

a(S) = ^% (12) 

for S < ft. For large S if P(K S ) > 

, (s) = hk.) = ^ = ss^l^v^E (13) 

V iJ 

where m v and are the logarithmic mean and variance of the unit growth distributions P v 
and V = exp(V^) [exp(V^) — 1]. Thus one expects to have a crossover from ft = for S < ft 
to ft = 1/2 for S » S*, where 

S* = exp(3V c /2 + m c )(exp(K 7 ) - 1)/V V (14) 

is the value of S for which Eq. (1121) and Eq. ffl3|) give the same value of cr(S). Note that for 
small V v < 1, S* ~ exp(3V^/2 + m^). The range of crossover extends from S\ to S*, with 
S*/Si = exp(V^) — > oo for — > oo. Thus in the double logarithmic plot of o vs. S one can 
find a wide region in which the slope ft slowly vary from to 1/2 [ft w 0.2) in agreement 
with many empirical observations. 
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The crossover to (3 = 1/2 will be observed only if K* = S* / ^ = exp(V^) is such that 
P(K*) is significantly larger than zero. For the distribution P{K) with a sharp exponential 
cutoff K = K , the crossover will be observed only if K » exp(V^). 

Two scenarios are possible for S > So = K Q fi^. In the first, there will be no economic 
system with S >> S . In the second, if the distribution of the size of units P^ is very broad, 
large economic systems can exist just because the size of a unit can be larger than Sq. In 
this case exceptionally large systems might consist of one extremely large unit £ max , whose 
fluctuations dominate the fluctuations of the entire system. 

One can introduce the effective number of units in a system K e = S/^ mauX , where £ max 
is the largest unit of the system. If K e < 2, we would expect that <j(S) will again become 
equal to its value for small S given by Eq. (fT2l . which means that under certain conditions 
<j(S) will start to increase for very large economic systems and eventually becomes the same 
as for small ones. 

Whether such a scenario is possible depends on the complex interplay of an d P{K). 
The crossover to /3 — 1/2 will be seen only if P(K > K*) > P(£ > S*) which means that 
such large systems predominantly consist of a large number of units. Taking into account 
the equation of P%, one can see that P(£ > S*) ~ exp(— 9/8V£). 

On the one hand, for an exponential P(K), this implies that 

exp(-exp(V r € )/ir ) > exp(-9/8^) (15) 

or 

^>8exp(V9/(9^ ). (16) 

This condition is easily violated if >> lni^ - Thus for the distributions P(K) with 
exponential cut-off we will never see the crossover to/3 = l/2ifV^>> lnK . 

On the other hand, for a power law distribution P(K) ~ K~^, the condition of the 
crossover becomes exp(V^) 1_ * > exp(— 9/8V£), or (0 — 1)V£ < 9/8V^ which is rigorously 
satisfied for 

(f) < 17/8 (17) 
but even for larger is not dramatically violated. Thus for power law distributions, we 
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expect a crossover to (3 = 1/2 for large S and significantly large number N of economic 
entities in the data set: NP(K*) > 1. The sharpness of the crossover mostly depends on V^. 
For power law distributions we expect a sharper crossover than for exponential ones because 
the majority of the economic systems in a power law distribution have a small number of 
units K, and hence j3 = almost up to S*, the size at which the crossover is observed. For 
exponential distributions we expect a slow crossover which is interrupted if V% is comparable 
to lni^o- For S » S\ this crossover is well represented by the behavior of a(Ks)- 

We confirm these heuristic arguments by means of computer simulations. Figure [9] shows 
the behavior of u(S) for the exponential distribution P(K) = exp(—K/{K))/(K) and log- 
normal P e and P v . We show the results for K = 1, 10, 100, 1000, 10000 and = 1, 5, 10. 
The graphs a(K s ) and the asymptote given by Eq. (IT5|) are also given to illustrate our the- 
oretical considerations. One can see that for Vg = 1, <j(S) almost perfectly follows cr(Ks) 
even for (K) = 10. However for = 5, the deviations become large and cr(S) converges to 
<j(Ks) only for (K) > 100. For Vg = 10 the convergence is never achieved. 

Figures [2] and [6] illustrate the importance of the effective number of units K e . When Ks 
becomes larger than Kq, <j{S) starts to follow a(K e ). Accordingly, for very large economic 
systems cr(S) becomes almost the same as for small ones. The maximal negative value 
of the slope /3 max of the double logarithmic graphs presented in Fig. [2Ja) correspond to 
the inflection points of these graphs, and can be identified as approximate values of (3 for 
different values of Kq. One can see that /? max increases as Kq increases from a small value 
close to for Kq = 10 to a value close to 1/2 for K = 10 5 in agreement with the predictions 
of the central limit theorem. 

To further explore the effect of the P{K) on the size- variance relationship we select P{K) 
to be a pure power law P(K) ~ K~ 2 [Fig. Efa)]. Moreover, we consider a realistic P(K) 
where K is the number of products by firms in the pharmaceutical industry [Fig. [3](b)]. As 
we have seen in Chapter II, this distribution can be well approximated by a Yule distribution 
with <p = 2 and an exponential cut-off for large K. Figure [3] shows that, for a scale- free 
power-law distribution P{K), in which the majority of firms are comprised of small number 
of units, but there is a significant fractions firms comprised of an arbitrary large number 
of units, the size variance relationship depicts a steep crossover from a = sJV^ given by 
Eq. (fl2"|) for small S to o = y/V/ Ks given by Eq. f|T3l) for large S, for any value of 
(Riccaboni, 2008). 
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As we see, the size- variance relationship of economic systems &(S) can be well approxi- 
mated by the behavior of cr(Ks) [Fig|2](a)] . It was shown in Buldyrev (2007) that, for realistic 
V%, cr 2 {K) can be approximated in a wide range of K as cr(K) ~ K~" with (3 ~ 0.2, which 
eventually crosses over to K~ l l 2 for large K. In other words, one can write cr(K) ~ K~^ K ^ 
where /3(K), defined as the slope of cr(K) on a double logarithmic plot, increases from a 
small value dependent on at small K to 1/2 for K — > oo. Accordingly, one can expect 
the same behavior for cr(S) for K$ < Kq. 

As Ks approaches K , cr(S) starts to deviate from cr(Ks) in the upward direction. This 
results in the decrease of the slope /3(S) as S — > oo and one may not see the crossover to 
(3 = 1/2. Instead, in a quite large range of parameters (3 can have an approximately constant 
value between and 1/2. 

Thus it would be desirable to derive an exact analytical expression for cr(K) in case of 
lognormal and independent and P v . Using the fact that the n-th moment of the lognormal 
distribution 

P x {x) = -j== ~ exp (-(lnx, - m x ) 2 /2V x ) , (18) 

is equal to 

Vn,x = (x 11 ) = exp(nm x + n 2 V x /2) (19) 
we can make an expansion of a logarithmic growth rate in inverse powers of K: 

g = In — % -± 

( A 
= In + In I 1 + 



K(l + B/K) t 

V v A{1- B/K + B 2 /K 2 ...) A 2 {l-B/K + B 2 /K 2 ...) 2 
- m v + — + - — 2 + ... 

V v A AB + A 2 /2 ^ /r ,_ 3 , 

where 

B = ^ i=1 * (21) 

Using the assumptions that and rji are independent: (£ir)i) = (£i){7)i), (ViVj) = (Vi)(Vj)i 
and = for i ^ j, we find (A) = 0, (AB) = 0, (A 2 ) = CK, where C = a(b - 1) 



with a = exp(V^) and b = exp(V^). Thus 



oo 

m r 



» =(9) =Y.k^ 



n=0 

00 



2 -<»">-/<" =Eik < 22 > 



n=l 



where m = m v + V v /2, m 1 = —(7/2, V x = C, V 2 = C[a(5b + l)/2 - 1 - a 2 b(b + 1)]. 
The higher terms involve terms like (A n )/K n , which will become sums of various products 
(£i(Vi ~ f J >i,T]) k ), where 2 < k < n. The contribution from k = n has exactly K terms of 
Vn£lh2YTj=a /A?>?/ i i^( _1 ) n ~ J (i) witn Vj,xfhjL = ex P(Kj(j - l)/2). Thus there are contri- 
butions to m n and which grow as (ab) n ^ n+1 ^ 2 with afe > 1, which is faster than the n-th 
power of any A > 0. Thus the radius of convergence of the expansions ( 1221) is equal to zero, 
and these expansions have only a formal asymptotic meaning for K — > 00. However, these 
expansions are useful since they demonstrate that fi and a do not depend on and 
except for the leading term in //: m = + V v /2. 

Not being able to derive close-form expressions for a, we perform extensive computer 
simulations, where £ and 77 are independent random variables taken from lognormal distri- 
butions and P v with different and V v . The numerical results (Fig. HJ) suggest that 

\na 2 (K)K/C « F CT [ln(iT) - K,)] , (23) 

where -F<r(z) is a universal scaling function describing a crossover from F a (z) — > for 2; — > 00 
to F„{z)jz — > 1 for z — > —00 and /(V^, V^) ~ /^(^?) + fvi^v) are functions of and 
which have linear asymptotes for — > 00 and — > 00 [Fig. IU(b)]. 

Accordingly, we can try to define j3{z) = (l — dF a /dz)/2 [Fig. [5] (a)]. The main curve /3(z) 
can be approximated by an inverse linear function of z, when z — > —00 and by a stretched 
exponential as it approaches the asymptotic value 1/2 for z — > +00. The particular analytical 
shapes for these asymptotes are not known and derived solely from least square fitting of the 
numerical data. The scaling for (3{z) is only approximate with significant deviations from 
a universal curve for small K. The minimal value for f3 practically does not depend on V v 
and is approximately inverse proportional to a linear function of V^: 

fimin 77 I (^V 

pV^ + q 

where p ~ 0.54 and q ~ 2.66 are universal values [Fig. [3(b)]. This finding is significant 
for our study, since it indicates that near its minimum, (3{K) has a region of approximate 
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constancy with the value (3 m i n between 0.14 and 0.2 for between 4 and 8. These values 
of are quite realistic and correspond to the distribution of unit sizes spanning over from 
roughly two to three orders of magnitude (68% of all units), which is the case in the majority 
in economic and ecological systems. Thus our study provides a reasonable explanation for 
the abundance of value of /3 ~ 0.2. 

The above analysis shows that a(S) is not a true power-law function, but undergoes a 
crossover from (3 = (3 m in{V^) for small economic systems to /3 = 1/2 for large ones. However 
this crossover is expected only for very broad distributions P{K). If it is very unlikely to 
find an economic complex with K > Kq, <j(S) will start to grow for S > Ao//£. Empirical 
data do not show such an increase (Fig. [7]), because in reality there are few giant entities 
which rely on few extremely large units. These entities are extremely volatile and hence 
unstable. Therefore for real data we do see neither a crossover to (3 = 1/2 nor an increase 
of a for large economic systems. 



III. THE EMPIRICAL EVIDENCE 



Since the size variance relationship depends on the partition of firms into their constituent 
components, to properly test our model one must decompose an economic system into 
parts. In this section we analyze the pharmaceutical industry database which covers the 
whole size distribution for products and firms and monitors flows of entry and exit at every 
level of aggregation. Products are classified by companies, markets and international brand 
names, with different distributions P{K) with (K) = K ranging from 5.8 for international 
products to almost 1,600 for markets [Tab. Hj. If firms have on average Kq products and 

« lni^Q, the scaling variable z = K is positive and we expect (3 — > 1/2. On the 
contrary, if >> In A , z < and we expect (3 — > 0. These considerations work only for a 
broad distribution of P{K) with mild skewness such as an exponential distribution. At the 
opposite extreme, if all companies have the same number of products, the distribution of S 
is narrowly concentrated near the most probable value Sq = /i^A and there is no reason to 
define (3(S). Only very rarely S >> S , due to a low probability of observing an extremely 
large product which dominates the fluctuation of a firm. Such a firm is more volatile than 
other firms of equal size. This would imply negative (3. If P(K) is power law distributed, 
there is a wide range of values of K, so that there are always firms for which In A" >> V% 
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and we can expect a slow crossover from (3 = for small firms to (3 = 1/2 for large firms, so 
that for a wide range of empirically plausible Vg, (3 is far form 1/2 and statistically different 
from 0. The estimated value of the size- variance scaling coefficient (3 goes form 0.123 for 
products to 0.243 for therapeutic markets with companies in the middle (0.188) [Tab. [I] and 
Fig. E|. 

Our model relies upon general assumptions of independence of the growth of economic 
entities from each other and from the number of units K. However, these assumptions could 
be violated and at least three alternative explanations must be analyzed: 

1. Size dependence. The probability that an active firm captures a new market oppor- 
tunity is more or less than proportional to its current size. In particular, there could 
be a positive relationship between the number of products of firm a (K a ) and the 
size and growth (r)i(a)) of its component parts due to monopolistic effects and 
economies of scale and scope. If large and small companies do not get access to the 
same distribution of market opportunities, large firms can be riskier than small firms 
simply because they tend to capture bigger opportunities. 

2. Units interdependence. The growth processes of the consituent parts of a firm are 
not independent. One could expect product growth rates to be positively correlated 
at the level of firm portfolios, due to product similarities and common management, 
and negatively correlated at the level of relevant markets, due to substitution effects 
and competition. Based on these arguments, one would predict large companies to 
be less risky than small companies because their product portfolios tend to be more 
diversified. 

3. Time dependence. The growth of firms constituent units does not follow a pure Gibrat 
process due to serial auto-correlation and lifecycles. Young products and firms are 
supposed to be more volatile then predicted by the Gibrat's Law due to learning 
effects. If large firms are older and have more mature products, they should be less 
risky than small firms. On the contrary, ageing and obsolescence would imply that 
incumbent firms are more unstable than newcomers. 

The first two hypotheses are not falsified by our data (Fig. [TO]) . 
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The number of products of a firm and their average size defined as {£(K)) = Y2i=i 
where () indicates averaging over all companies with K products, has an approximate power 
law dependence (£(K)) ~ K 1 , where 7 = 0.38. 

The mean correlation coefficient of product growth rates at the firm level (p(K)) shows 
an approximate power law dependence (p(K)) ~ K^, where ( = —0.36. 

Since larger firms are composed by bigger products and are more diversified than small 
firms the two effects compensate each other. Thus if products are randomly reassigned to 
companies, the size variance relationship will not change. 

As for the time dependence hypothesis, despite there are some departures from a Gibrat 
process at the product level (Fig. [TTj) due to lifecycles and seasonal effects, they are too weak 
to account for the size variance relationship. Moreover asynchronous product lifecycles are 
washed out upon aggregation. 

To discriminate among different plausible explanaitons we run a set of experiments in 
which we keep the real P{K) and randomly reassign products to firms. In the first simulation 
we randomly reassign products by keeping the real world relationship between the size, £, 
and growth, rj, of products. In the second simulation we reassign also 77. Finally in the last 
simulation we generate elementary units according to a geometric brownian motion (Gibrat 
process) with empirically estimated values of the mean and variance of £ and rj. Tab. [I] 
summarizes the results of our simulations. 

The first simulation allows us to check for the size dependence and unit interdepence 
hypotheses by randomly reassigning elementary units to firms and markets. In doing that, 
we keep the number of the products in each class and the history of the fluctuation of each 
product sales unchanged. As for the size dependence, our analysis shows that there is indeed 
strong correlation between the number of products in the company and their average size 
defined as 



where () indicates averaging over all companies with K products. We observe an ap- 
proximate power law dependence (£(K)) ~ K 1 , where 7 = 0.38. If this would be a true 
asymptotic power law holding for K — > 00 than the average size of the company of K prod- 
ucts would be proportional to £(K)K ~ K 1+1 . Accordingly, the average number of products 




(25) 



i=l 
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in the company of size S would scale as K (S) ~ gVd+i) and consequently due to central 
limit theorem 3 — 1/(2 + 27). In our data base, this would mean that the asymptotic value 
of 8 = 0.36. Similar logic was used to explain 3 in [15|, [19(. Another effect of random redis- 
tribution of units will be the removal of possible correlations among rji in a single firm (unit 
interdependence). Removal of positive correlations would decrease 3, while removal of neg- 
ative correlations would increase 3. The mean correlation coefficient of the product growth 
rates at the firm level (p(K)) also has an approximate power law dependence (p(K)) ~ K^, 
where ( = —0.36. Since larger firms have bigger products and are more diversified than 
small firms the size dependence and unit interdepencence cancel out and 8 practically does 
not change if products are randomly reassigned to firms. 

N K gi g gj ft 

Markets 574 1,596.9 0.243 0.213 0.232 0.221 

Firms 7,184 127.5 0.188 0.196 0.125 0.127 

International Products 189,302 5.8 0.151 0.175 0.038 0.020 
All Products 916,036 - 0.123 0.123 

TABLE I: The size- variance relationship cr(S) ~ S~^^: estimated values of f3 and simulation 
results j3* at different levels of aggregations from products to markets. In simulation 1 ) products 
are randomly reassigned to firms and markets. In simulation 2 (fi^) the growth rates of products 
are reassigned too. In simulation 3 ((3^) we reproduce our model with real P(K) and estimated 
values of = 7.58 and = 2.10. 

To control the effect of time dependence, we keep the sizes of products £j and their number 
K a at year t for each firm a unchanged, so S t = Y2?=i * s the same as m the empirical data. 
However, to compute the sales of a firm in the following year St+i = J2f=i £i> we assume that 
6i = CiVii where rji is an annual growth rate of a randomly selected product. The surrogate 
growth rate g = In obtained in this way does not display any size-variance relationship 
at the level of products [3% = 0). However, we still observe a size variance relationship at 
higher levels of aggregation. This test demonstrates that 1/3 of the size variance relationship 
depends on the growth process at the level of elementary units which is not a pure Gibrat 
process. However, asynchronous product lifecycles are washed out upon aggregation and 
there is a persistent size-variance relationship which is not due to product auto-correlation. 
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Finally we reproduced our model with the empirically observed P{K) and the estimated 
moments of the lognormal distribution of products (m^ = 7.58, = 4.41). We generate 
N random products according to our model (Gibrat process) with the empirically observed 
level of and m^. As we can see in Tab. HI our model closely reproduce the values of (3 
at any level of aggregation. We conclude that a model of proportional growth in both the 
number and the size of economic units correctly predicts the size-variance relationship and 
the way it scales under aggregation. 

The variance of the size of the constituent units of the firm Vg and the distribution of units 
into firms are both relevant to explain the size variance relationship of firm growth rates. 
Simulations results in Fig. [7] reveal that if elementary units are of the same size (Vg = 0) 
the central limit theorem will work properly and (3 ~ 1/2. As predicted by our model, by 
increasing the value of we observe at any level of aggregation the crossover of (3 form 1/2 
to 0. The crossover is faster at the level of markets than at the level of products due to the 
higher average number of units per class Kq. However, in real world settings the central 
limit theorem never applies because firms have a small number of components of variable 
size (V£ > 0). For empirically plausible values of and K (3 ~ 0.2. 



IV. DISCUSSION 



Firms grow over time as the economic system expands and new investment opportunities 
become available. To capture new business opportunities firms open new plants and launch 
new products, but the revenues and return to the investments are uncertain. If revenues 
were independent random variables drawn from a Gaussian distribution with mean m e and 
variance V e one should expect that the standard deviation of the sales growth rate of a firm 
with K products will be a(S) ~ S^^^ with (3 = 1/2 and S = m e K. On the contrary, if the 
size of business opportunities is given by a geometric brownian motion (Gibrat's process) and 
revenues are independent random variables drawn from a lognormal distribution with mean 

and variance the central limit theorem does not work effectively and f3(S) exhibits a 
crossover from (3 = for 5* — > to (3 = 1/2 for S —>■ oo. For realistic distributions of the 
number and size of business opportunities, (3(S) is approximately constant, as it varies in 
the range from 0.14 to 0.2 depending on the average number of units in the firm K and 
the variance of the size of business opportunities V%. This implies that a firm of size S is 
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expected to be riskier than the sum of S firms of size 1, even in the case of constant returns 
to scale and independent business opportunities. 
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"Old" "New" 
Class 1 Class 2 i 




FIG. 1: Schematic representation of the model of proportional growth. At time t = 0, there 
are N(0) = 2 classes (□) and n(0) = 5 units (O) (Assumption Al). The area of each circle 
is proportional to the size £ of the unit, and the size of each class is the sum of the areas of 
its constituent units (see Assumption Bl). At the next time step, t = 1, a new unit is created 
(Assumption A2). With probability b the new unit is assigned to a new class (class 3 in this 
example) (Assumption A3). With probability 1 — 6 the new unit is assigned to an existing class 
with probability proportional to the number of units in the class (Assumption A4). In this example, 
a new unit is assigned to class 1 with probability 3/5 or to class 2 with probability 2/5. Finally, 
at each time step, each circle i grows or shrinks by a random factor rji (Assumption B2). 
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FIG. 2: (a) Simulation results for cr(S) according to Eq. (Ill|) for exponential P(K) = 
exp(-K/K )/K with K = 10, 10 2 , 10 3 , 10 4 , 10 5 and lognormal P 5 and P v with = 5.13, = 
3.44, V v = 0.36, n„ = 0.016 computed for the pharmaceutical database. One can see that, for small 
enough S and for different Kq, cr(S) follows a universal curve which can be well approximated with 
a(Ks), with Ks = S/fJ,^ ~ 5/405. For K$ > Kq, a(S) departs from the universal behavior and 
starts to increase. This increase can be explained by the decrease of the effective number of units 
K e (S) for the extremely large firms. The maximal negative slope /? max increases as Kq increases in 
agreement with the predictions of the central limit theorem, (b) One can see, that K e {S) reaches 
its maximum around S ~ Kji^. The positions of maxima in K e (S) coincide with the positions of 
minima in cr(S). 



20 




FIG. 3: Size variance relationship cr(S) for various with P(K) ~ K~ 2 (a) and real P(K) (b).A 
sharp crossover from /3 = to (3 = 1/2 is seen for the power law distribution even for large values 
of Vg. In case of real P(K) one can see a wide crossover regions in which cr(S) can be approximated 
by a power-law relationship with < (3 < 1/2. Note that the slope of the graphs (J3) decreases 
with the increase of V£. The graphs of (3{Ks) and their asymptotes are also shown with squares 
and circles, respectively. 
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FIG. 4: (a) Simulation results for cr 2 (K) in case of lognormal P% and and different and 
plotted on a universal scaling plot as a functions of scaling variable z = ln(i^) — /(V£, V^). (b) The 
shift function /(Vg, V^). The graph shows that /(% V„) » / € (V£) + Both / f (V£) and /^(V^) 

(inset) are approximately linear functions. 
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FIG. 5: (a) The effective exponent j3{z) obtained by differentiation of cr 2 (z) plotted in Fig. 0] (a). 
Solid lines indicate least square fits for the left and right asymptotes. The graph shows significant 
deviations of (3(K, V^, V^) from a universal function (5{z) for small K, where (3{K) develops minima, 
(b) The dependence of the minimal value of (5 m in on One can see that this value practically 
does not depend on V v and is inverse proportional to the linear function of V/t. 
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FIG. 6: The standard error of firm growth rates (a) (circles), and the share of the largest products 
(1/K e ) (squares) versus the size of the firm (S). As predicted by our model for S < Si = /xg ps 3.44, 
/3 ~ 0. For S > S\ (5 increases but never reaches 1/2 due to the slow grow of the effective number 
of products (K e ). The flattering of the upper tail is due to some large companies with unusually 
large products. 
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FIG. 7: The scaling of the size- variance relationship as a function of V£. (3 decays rapidly from 1/2 
to for — > oo. In the simulation we keep the real P(K) for products, companies and markets 
and assign products drawn from a lognormal distribution with the empirically observed mean 
and variance < < 25. 
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FIG. 8: Simulation results for the conditional growth rate distribution P(g\S,K) for the case of 
lognormal P^ and P^, with = 6, V v = 1 and = = 0. For K = 1 the distribution is 
perfectly Gaussian with V v = 1 and = 0. However for large K the distribution develops a 
tent-shape form with the central part close to a Gaussian with mean m = 1/2 as predicted by Eq. 
The vast majority of firms (99.7%) have sizes in the vicinity of Kfi^ which for K = 2 15 and 
= exp(m,£ + V£/2) = 20.1 belongs to the bin [2 19 ,2 20 ] and only 0.25% of firms belong to the 
next bin [2 20 , 2 21 ]. These firms are due to a rare occurrence of extremely large products. The real 
number of products in these firms is K e = 2.4, while the normally sized firms have K e = 31. The 
fluctuations of these extremely large products dominate the fluctuations of the firm size and hence 
P(g\S, K) for such abnormally large firms is broader than for normally sized firms. Accordingly, 
a = 0.09 and a = 0.41 respectively for the normally sized and abnormally large firms. 
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FIG. 9: The behavior of cr(S) for the exponential distribution P{K) = exp(—K/{K))/{K) and 
lognormal P 5 and P v . We show the results for K = 1,10,100,1000,10000 and V(. = 1,5,10. 
The graphs a(K s ) and the asymptote given by a(S) = ^VjK~ s = cxp(3 ^ /4+m ^g } V^^El are 
also given to illustrate our theoretical considerations. One can see that for = 1, cr(S) almost 
perfectly follows a (Kg) even for (K) = 10. However for = 5, the deviations become large and 
a(S) converges to cr(Ks) only for (K) > 100. For Vje = 10 the convergence is never achieved. 



26 



(a) 



io 5 r 



o , 
O f ,S> 

9-''' 



A 
V 



,-bo 



lo- 



op.' 



slope = y= 0.38 



1 n 1 9 3 4 

10 10 10 10 10 

K 



(b) 



A 1 
9" 10 



sloped = -0.36 



10 — o 
10 



K 



10 10 



FIG. 10: (a) The relationship between the average product size and the number of products of 
the firm. The log-log plot of (^(K)) vs. K shows power law dependence (£(K)) ~ K°' 3S . (b) The 
relationship between the mean correlation coefficient of product growth rates and the number of 
products of a firm. The log-log plot of (p(K)) vs. K shows power law dependence (p(K)) ~ K S8 . 
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FIG. 11: (a) The average growth and the auto-correlation coefficient of products since launch. 
Products growth tend to be higher in the fist two years from entry. We detect seasonal cycles 
and a weak (not significant) negative auto-correlation, (b) The average growth rate and the auto- 
correlation coefficient of firms from entry. The departures of product growth from a Gibrat process 
are washed out upon aggregation. The growth rates do not depend on age and do not show a 
significant auto-correlation. 
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